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ABSTRACT 

An extensive sample of galaxy clusters will be available in the coming years, detected through their 
Sunyaev-Zeldovich effect (SZE) . We use a semi-analytic model to study the scientific yield of combining 
SZE data with X-ray and optical follow-up observations. If clusters at a given redshift z Q can be identified 
with virialized, spherical halos, they populate a well-defined "fundamental plane" (FP) in the parameter 
space of the three observables: virial temperature (T), total Sunyaev-Zeldovich flux decrement (AS„), 
and angular size (9). The location and orientation of the FP, as well as its redshift-evolution, are 
sensitive to both the internal evolution of clusters, and to the underlying cosmological parameters. We 
show that if clusters are not standard candles (e.g. due to feedback, or energy injection), then this can 
be inferred from the FP. Likewise, we study the dependence of the FP on the cosmological parameters 
h, (78: and firji and quantify constraints on these parameters. We also show that in the absence of any 
non-gravitational effects, the scatter in the the (AS U — T) plane is significantly smaller than in either 
the (9 — T) or the (9 — AS V ) planes. As a result, the (AS V — T) relation can be an exceptionally sensitive 
probe of both cluster physics and cosmological parameters, and a comparison of the amount of scatter in 
these three scaling relations will serve as a test of the origin (cosmological vs. stochastic) of the scatter. 

Subject headings: Cosmology: theory, cosmological parameters, galaxies: clusters: general 



1. INTRODUCTION 

Galaxy clusters provide a uniquely useful probe of 
the fundamental cosmological parameters. The observed 
abundance of nearby clusters constrains the amplitude as 
and sl ope n of the primordial power spectrum on cluster 
scales flEvrard 1989|,|Hcnry fc Arnaud I99l|, |Bahcall fc Gen 



1992 



1996 



White, Efstathiou & Frenk 19931, Eke Cole & Front 



Picrpaoli Scott fc White 2000|), while the cluster 
mass junction ( [Liljc 1992] , pahcall fc Ccn 1993|, |Viana &| 
Liddle 1996 ) , and its redshift- evo l ution (|Oukbir 



Blan- 



chard 1992[ pahcall fc Fan 1998|, pianchard fc Bartlctl 



1998| , |Viana fc Liddle 1999| , [Willick 200q ), places usefu 
constrains on the density parameters Qq and as well 
as on the equation of state of the dark energy componen t 
(|Wang fc Steinhardt 199"g|, |Haiman, Mohr fc Holder 200l|). 



The analyses above require an understanding of the in- 
ternal physics of galaxy clusters, and the extension of the 
physical properties of the local cluster sample towards 
higher rcdshifts. In other words, for cosmological stud- 
ies, one would ideally like to use clusters as "standard 
candles". The central assumptions that make this possi- 
ble are that clusters are virialized objects at characteristic 
background densities scaling with redshift as p oc (1 + z) 3 . 
Under these assumptions, the temperatures, masses and 
redshifts of clusters are related by the virial theorem. As 
demonstrated recently by Haiman, Mohr & Holder (2001), 
the use of large future galaxy cluster surveys for cosmolog- 
ical studies is likely to be limited by the validity of these 
assumptions, rather than by statistical uncertainties. 



Galaxy clusters exhibit various useful scaling relations, 
such as those between mass and t emperature (M — T; e.g. 



Mohr, Mathicscn fc Evrard 1999|), luminosity and temper- 
ature (L — T; e.g., Eke, Navarro fc Frenk 1998|), o r size and 
temperature (R - T; e.g., pVIohr fc Evrard 1997|). Analy- 



ses of these, and other, similar scalings laws have yielded 
insight into the physical nature of clusters. In addition, 
the scaling relations and their scatter also contain useful 
cosmological information ( Verde et al. 2000). The temper- 
ature of the hot intra-cluster medium (ICM) is a uniquely 
direct measure that can be accurately determined from X— 
ray spectra. On the other hand, the (X-ray) luminosity 
is sensitive to the density profiles near the cluster core, 
and the measurements of the masses (from gravitational 
lensing, or from the velocity dispersion of member galax- 
ies) and radii (e.g., from X-ray isophotes) typically suffer 
from systematic uncertainties and are thus less robust. 

A direct probe of the hot gas in galaxy clusters is pro- 
vided by the so-called Sun yaev-Zeldovich effect (SZE, 
Sunyaev fc Zcldovich 1980| ) . Cosmic microwave back- 



ground (CMB) photons interact with the hot ionized 
intra-cluster gas along their path, distorting the CMB 
spectrum. In the Rayleigh- Jeans regime, the distortion 
results in a "decrement" (SZD) of the CMB temperature 
that depends only on the optical depth to Compton scat- 
tering and the cluster temperature, but not on the redshift 
of the cluster. While the SZE has currently been measured 
only in a handful of clusters, future SZE surveys can de- 
tect rich clusters up to arbitrary redshifts; for example, 
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a recently initiated interferometric survey (Holder et al 
20000 covering 12 deg 2 , will detect hundreds of clusters 
below redshift z — 3. 

The SZE can be characterized in two ways: by its "sur- 
face brightness" (i.e. the value of the SZ decrement per 
unit solid angle) near the central region of the cluster, or 
by its total flux decrement (i.e. the decrement integrated 
over the whole solid angle of the cluster). While at present 
SZE studies have mainly used the central SZ decrement, 
here we propose to use the total SZ flux decrement. This 
quantity should be a more robust indicator of global clus- 
ter properties and has a different dependence on cosmo- 
logical parameters than the central decrement. However, 
the total decrement has been difficult to measure to date. 
Single dish experiments can, in principle, measure the to- 
tal decrement, but in practice, the need to subtract large 
backgrounds have resulted in limited signal to noise (Ma- 
son et al. 2001; Myers et al. 1997). The most precise SZE 
observations have so far been obtained with interferome- 
ters. Interferometric (Joy et al. 2001; Kneissl et al. 2001; 
Grego et al. 2000; Reese et al. 2000; Carlstrom et al. 
2000; Jones et al. 2001; Birkinshaw 1998 and references 
therein) observations offer several advantages over obser- 
vations made with single dishes, in terms of better control 
over systematics and contamination effects. However, at 
present, the angular size that can be imaged is small, rela- 
tive to the angular size of the massive clusters that can be 
studied at the current sensitivities. Hence, these observa- 
tions typically do not cover the whole cluster, and instead 
measure only the central decrement. 

The prospects for measuring the total SZ flux decrement 
will dramatically improve in the next few years. Forthcom- 
ing SZE (e.g., DASI, CBI) and cosmic microwave back- 
ground (CMB) experiments will not only measure the to- 
tal SZD of an extended cluster sample, but, in many cases, 
will also resolve the temperature fluctuation over the an- 
gular size of the whol e cluster 



For example MAP (Bennett et al. 1995) will detect tens 
of clusters and resolve a few of them; Planck (Bcrsanelli et 
al. 1996; Mandolesi et al. 1998) will detect ~ 10 4 clu sters 
and resolve and measure the total SZD of 1% of them flKayJ 
Liddle & Thomas 2001). An experiment such as the pro- 



posed Center for study of Cosmic structure (CSCS; Page 
at al. 2001) could map the CMB anisotropy over 100deg 2 
with a resolution of 1.7'. This would allow all galaxy clus- 
ters of masses above 4 x 10 14 M© in the CMB map region 
to be detected through the SZ effect. The experiment 
would also determine spectroscopic redshift of more that 
300 clusters, and implement X-ray follow up to determine 
their X-ray temperatures. 

In this paper, we investigate the scientific potential of 
measuring the cluster total SZ flux decrement in the con- 
text of a well studied cluster sample for which X-ray and 
optical follow-up are available. As illustrated in § 3, the 
total SZ flux decrement (AS),) is an integrated quantity 
over the whole cluster, not just along the central line of 
sight, and should thus be more sensitive to global cluster 
properties. In particular, AS V probes a unique combi- 
nation of the physical parameters of the hot intra-cluster 
gas, different from those inferred from X-ray observations, 
and from other data. This allows the construction of new 
scaling relations. In this paper, wc examine these scaling 



relations in detail. We are motivated by the forthcoming 
sample of clusters with measured SZ decrements, as well 
as by the strong need for observational tests of the as- 
sumption that clusters can be used as "standard candles" . 
Our goals are (1) to predict scaling relations that involve 
the SZD, together with their scatter, and (2) to study how 
these scaling relations depend on assumptions about the 
cluster structure (i.e. that clusters are standard candles), 
as well as on the underlying cosmological parameters. In 
particular, we quantify these dependencies in the context 
of a well studied cluster sample, such as those expected to 
be available in the near future, for which it will be possible 
to combine SZE and X-ray observations. 

This paper is organized as follows. In § g, we describe 
our cluster model and its input model parameters. In 
§ |3|, we summarize the main observables that the model 
predicts: redshift (z), virial temperature (T), Sunyaev- 
Zeldovich decrement (AS„), and angular size (6). In § |[ 
we show that in this model at a fixed z in (T, AS„, 6) space, 
clusters are expected to be distributed on a "fundamen- 
tal plane" with nearly negligible scatter in the AS), — T 
projection, and that the plane's orientation depends on 
redshift and on the background cosmology. In § pL we pa- 
rameterize deviations from our cluster model caused by 
additional energy input, or by a lack of full virialization, 
and quantify their effects on the scaling relations. Simi- 
larly, in § ^, we quantify the effects on the scaling relations 
caused by varying several cosmological parameters. In § 
we discuss our results and the implications of this work. 
Finally, in § ^[ we summarize our conclusions. 

2. MODEL INGREDIENTS 

2.1. Spherical Collapse 

Generally, galaxy clusters models are based on the col- 
lapse of a spherical top-hat perturbation (e.g., Pccblcsj 
1980). In this model, the average density p v ir enclosed 
within the radius i? V i r of a cluster of total (dark matter 
+ baryon) mass M w - lT that forms at redshift Zf is given by 
(e.g., Kitayama & Suto (1996) hereafter KS96) 



p( z f) 



(1) 



where the factor p(zf) describes departures from a stan- 
dard Einstein de-Sitter universe, and is given by 



where 



p(z f ) w 1 + 0.4093^' 

w f = i/n, - i, 
n (i + zf? 



(2) 
(3) 

(4) 

n {i + z s Y+{i-n Q -n A ){i + z s Y+n A v ' 

In the above equations, p(zf) — ^oPcrit,o(l + z f) 3 is the 
mean matter density at redshift z f , and and Cl\ are the 
present day densities of matter and cosmological constant, 
in units of the critical density p cr it,o- 

We assume that the cluster is isothermal, and that the 
gas acquires the virial temperature of the halo. In this 
case, the temperature is related to the mass and virial ra- 
dius by the virial theorem, 



T 



FL, 



M v 
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where we have assumed that the gas has a mean molecular 
weight of jit = 0.59, appropriate for a fully ionized mixture 
of H and He with a number density ratio of nuo/nu = 0.08. 
Combining equations (0) _ (|^), the temperature of a cluster 
is given in terms of its mass and formation redshift, 



[KeV] 



0.62(1 + 2/) 



[p(zf)] 



1/3 



~n Q h 2 

0.17 



V 10 14 M r . 



1/3 



(- 



2/3 



[KeV] (6) 



This ciqualion qrrantrflra tire luL. uiLloii tfert dusters virl- 

alizing at an earlier epoch are hotter than clusters of the 
same mass that virialize at later times. Following KS96, we 
note that, in fact, clusters of a given mass M observed at 
a redshift z Q form over a range of earlier redshifts Zf > z a . 
While equations (&@) give the temperature and mass of 
a cluster when it first forms and virializes, the cluster is 
subsequently expected to evolve. In principle, both the 
mass and the temperature can change between Zf and z a 



As argued in KS96, nume rical simulations (e.g., Evrard 



I 



olve 



1990, |Navarro et al. 199q) indicate that the temperature 



does not significantly evolve after virialization. We there- 
fore make the assumption here that the final temperature 
of the cluster at redshift z Q is equal to T vu . Below, we 
will consider models that describe departures from this 
assumption (see § 2.3). 

However, clusters do grow in mass after virialization. 
To account for this growth, we assume that the ratio be- 
tween the final mass M and the mass at virialization M v - a 
is a "universal" constant M v i r = /m x M for all clusters 
(see Lacey & Cole 1993; we will relax this assumption be- 
low). Following Viana & Liddle (1996), we set /m = 0.75, 
which results in the best fit to the mass-temperature rela- 
tion (both for its slope and scatter) in cluster simulations. 

2.2. Mass Function and Age Distribution 

The mass and formation redshift of a cluster are not di- 
rectly measurable quantities. Nevertheless, it is possible to 
compute the cluster mass function, dN/dM(M, z Q ) (e.g. 
Press fc Schechter 1974 |Sheth, Mo fc Tormcn 1999|, [Jenk- 



ins ct al. 200C), and also the statistical distribution of the 



formation redshifts Zf of clusters of a give n mass M ob 
served at redshift z a , dN/dzf(M, z ) (e.g., |Lacey fc Cole 
1993| , |5asaki 1994[ ). In what follows, we will use the stan- 
dard Press & Schechter (Press & Schechter 1974; PS) mass 
function, with fitting formulas for the cosmological trans- 
fer function (Sugiyama 1995), and for the critical overden- 
sity 5 c (z) for collapse (KS96): 



S c (z) 



5 (z)(l + z)g(0)/g(z 
3(12tt) 2 /3 



20 



(1 + 0.0123 log 10 %). (7) 



Here (1 + z)/g (z) is the usual linear theory growth factor 
(Peebles 1980). To obtain the formation redshift distri- 
bution for clusters of mass M at redshift z a , we follow 
equation 2.26 in Lacey & Cole (1993; hereafter LC93), 
and define "formation redshift" as the redshift at which 
clusters first acquire a fixed fraction Jm = 0.75 of their 
final mass. 



Our main motivation for the above choices is "techni- 
cal": the convenient semi-analytical derivation of dN/dzf 
is only applicable to the mass function in the standard 
Press-Schechter theory; at present no analogous deriva- 
tion exists for the improved mass functions, such as that 
of Jenkins et al. (2000). We also note that our qualitative 
conclusions below do not crucially depend on the shape 
of the distributions dN/dzj and dN/dM. Nevertheless, it 
is important to emphasize that improvements over the PS 
mass function have already been made, and simulations 
have possibly uncovered differences in the abun dance of 



Here h = Hq/ ''±00 km s Mpc is the Hubble constant. et~al 



massive clu sters (e.g., Seth, Mo & Tormen 1999, Jenkin; 



2000). Our analysis can be straightforwardly gener- 



alized to different choices for the mass function and forma- 
tion redshift distribution, as the latter becomes available. 



2.3. Deviations from Simple Spherical Collapse 

The simple model described above assumes that clusters 
are fully virialized objects whose abundance and structure 
is dictated by gravitational physics alone. Lack of full 
virialization, on-going mergers, or feedback from galaxy 
formation can modify these predictions. For example, 
the above simple scaling relations, together with the as- 
sumption that the X-ray luminosity Lx is dominated by 
Brcmsstrahlung emission, predicts Lx oc T 2 , while obser- 
vations indicate the steeper relation L x oc T 3 (see, e.g 



Kaise r 1991; Evrard & Henry 1991; Bryan & Norman 
1998). Altho ugh different explanations are still possible 
I Bryan 2001) this steep slope suggests additional heat in- 
put to the gas prior to virialization, which preferentially 
lowers the central density, and there fore the X-r a y emis- 



sivity, in smaller clu sters (see, e.g., Kaiser 1991; Bialck 



Evrard & Mohr 2001 and references therein for recent work 



on this subject). 

To model deviations from our simplified model, we here 
introduce two additional parameters, £ and a, by general- 
izing equation (p) as follows: 



T vlr oc (1 + z f ) a M%. 



(8) 



The choice of £ = 1.5 and a = 1 corresponds to the origi- 
nal equation ml), while different values of £ and a describe 
deviations from the simplest model. By design, both of 
these parameters capture departures from purely gravita- 
tional virial equilibrium (cf. eq. |J). The first parameter, 
£, mimics the effect of heat input. Preferential heating 
of small clusters corresponds to flattening the T — M re- 
lation, i.e. to values of £ > 1.5. This t ype of flattening 
has a lready been observed, e.g., $ ~ 1.6 (Mohr & Evrard 
1997[ ), f = 1.72 ( [Muanwong et al. 200"l|; from analy sis of 
cluster simulatio ns), | ~ 1.81 (|Xu, Jin fc Wu 200l|) , and 
£ = 1.98 ± 0.18 QMohr, Mathicscn fc Evrard 1999 ); 7%, 
10%, 20%, and 25% variations respectively. Within the 
context of the spherical top hat model, this discrepancy 
is partly alleviated by the fact that smaller clusters form 
over a broader range of redshifts, and are on average hot- 
ter than larger cluster (which, on average, form closer to 
z = z a ). We find that the effective slope £ c ff, under the 
assumption of /m = 0.75, would be £ e ff ~ 1-6. Neverthe- 
less it is important to model possibly still larger deviations 
from the £ = 1.5 scaling behavior. 
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The second parameter, a, quantifies deviations from the 
assumption that clusters at all redshifts are fully virial- 
ized. It is reasonable to consider, for example, the possi- 
bility that clusters at higher redshifts are more likely to be 
detected in the process of their initial collapse/ assembly, 
and are less likely to be fully virialized than their lower- 
redshift counterparts. This effect is mimicked by a choice 
of a < 1, implying cluster are "colder" at higher redshifts. 
Interestingly, there are examples of clusters hotter than 
one would expect if they were virialized (and can be taken 
as evidence for on-g qing mergers, or some other form of 
energ y injection; e.g., iTucker et al. 1998j, Iflocttigcr, Lokcn| 
& Bui jns 19971 , [Evrard, Mctzler fc Navarro 1996| ); this case 
is mimicked by a choice of a > 1. 

2.4. Modeling the Scatter 

The model described above is "deterministic" : the only 
source of scatter it predicts in the observables is that 
caused by the fact that clusters have a distribution of dif- 
ferent formation redshifts. This scatter is a direct conse- 
quence of cosmological initial conditions (which translates 
into a "scatter" by subsequent gravitational collapse) , and 
probes the high-c r tail of the primordial power spectrum 
( Verde ct al. 2000| ). For example, in the mass-temperature 
or size-temperature relations, this causes a scatter whose 
magnitude monotonically increases with decreasing tem- 
perature. 

It appears that this scatter alone is sufficient to ac- 
count for the scatter in the observed M — T relation 



(Viana & Liddlc 199C, 


dorncr, Mushotzky & Schar: 


1998, Finoguenov, Reipric 


h & Bohringer 2000, Xu, Jin & 


Wu 2001), and, for a suitable choice of erg, for the ob- 



served scatter in the size-temperature relation ( Verde et 



al. 2000, Mohr fe Evrard 1997, Mohr et al. 2000). Never 



thclcss, it is interesting to contrast the predictions of this 
"cosmological" scatter with scatter that can be caused by 
different physics. For example, in a recent analysis of hy- 
drodynamical cluster simulations (Mathiesen 2000), the 
X-ray temperatures do not appear to depend on the for- 
mation redshift (although see Evrard 1990 and Navarro et 
al. 1995, who find that cluster temperatures do not change 
after the cluster forms). These simulations are still in- 
conclusive, since the expected correlation between forma- 
tion redshift and cluster temperature is smaller than the 
scatter due to small-number statistics in the simulations. 
Furthermore, the observational scatter might even be due 
mostly to uncertainties in the mass, temperature (or size) 
determinations. We here nevertheless consider alternative 
origins for the scatter as a possibility. 

We will distinguish two extreme cases as a deterministic 
scenario, in which the cluster temperature depends on the 
formation redshift, with no additional source of scatter; 
and a stochastic scenario, in which the temperature scales 
directly with the observed redshift, and there is some ad- 
ditional source of scatter. In the latter case, the additional 
scatter may simply be observational, or it may be caused 
by processes that influence cluster temperatures, such as 
feedback from galaxy formation, other forms of heating or 
cooling, or simply that /m is not constant from cluster 
to cluster. Regardless of the source of this stochasticity, 
we assume that on average, the X-ray temperature of a 
cluster depends on its mass, redshift of observation and 



cosmology through (||), with the substitution z$ — > z a , 
but that there is, in addition, an intrinsic scatter around 
this mean T(z ) relation. We model this scatter as ran- 
dom deviations in the temperature from cluster to cluster, 
where the deviations AT are Gaussian distributed with a 
fractional r.m.s. width x = AT/T. 

In reality, the scatter seen in cluster scaling relations 
is likely due to a combination of a deterministic effect 
(such as in our deterministic scenario ) and a random vari- 
ation from cluster to cluster (such as in the stochastic sce- 
nario). For example, as we will discuss in § 5.1 below, 
approximately the same scatter in the M — T distribu- 
tion could be obtained within the deterministic scenario 
(with /m = 0.75; x — 0), or from a different scenario 
in which the formation redshift distribution was narrower 
(i.e. /m > 0.75), but the cluster temperature had some 
additional random variation (i.e. x > 0). The parame- 
ter x, together with /m, quantifies how much of the ob- 
served scatter is due to cosmological initial conditions vs. 
other sources. The deterministic scenario corresponds to 
(/m = 0.75; x = 0); the stochastic scenario corresponds to 
(/m = l;x > 0); and in-between cases are described by 
{fu < l;x > 0). 

2.5. Summary of Model Parameters 

In principle, one would like to set all parameters of our 
model to be free, and investigate what joint constraint 
can be imposed from observations. However, the number 
of parameters makes this approach impractical, and we 
are forced to impose constraints using other observations. 
We assume the background cosmology to be a flat A cold 
dark matter (CDM) model (as supported by recent CMB 
data; Mauskopf ct al. 1999 , de Bernardis et al. 2000 , Jaffc 
et al. 2000] ), described by the parameters f2o an d h. Un- 
less otherwise stated, we will assume that the combina- 
tion £loh 2 is a constant (justified by the fact that forth- 
coming CMB experiments will constrain this combination 
to better than 5%). We also assume a baryon fraction 

n b h 2 

(e-g. : 



0.02, consistent wit h recent D/H measurements 



Buries fc Tytler 1998). This leaves h as the only 



free cosmological parameter, and we will further impose 
0.2 < h < 0.9. The primordial power spectrum is specified 
by the normalization as and slope n; however, we here as- 
sume n = 1 (supported by recent CMB data), and, unless 
otherwise stated, we also adopt a 8 = 0.4951V - 60 (in f erred 



from local cluster abundance, Viana & Liddle 1999, Pier 



p aoli, Scott fc White 2000| ). 

In summary, our model is fully specified by the following 
7 parameters: 

• The cosmological parameters Qo,h, and erg; two of 
which can be eliminated by other constraints. 

• The parameters /m and x that describe scatter in the 
scaling relations attributable to cosmological initial 
conditions, and to other sources, respectively 

• The parameters £ and a that describe departures 
from the fully virialized simple top-hat collapse 
models. 

For illustrative purposes we define a "fiducial model" by 
the following choice of parameters: h = 0.65, n h 2 = 0.17, 
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Q = 0.4, cr 8 = 0.86, f M = 0.75, x = 0, £ = 1.5, and 
a = 1 for the deterministic model ; for the stochastic 
model we chose a; = 0.13 and /m = 1 and everything 
else remains unchanged. In addition to these parameters, 
to predict the Sunyaev-Zeldovich decrement (see § || be- 
low) , we need to know the mass fraction of baryons that 
constitute the hot gas in the intra-cluster medium (ICM) . 
This fraction (ficAi) can be inferred from a comparison 
of the clusters X-ray flux and X-ray temperature; here 
we assume ficM ~ 0-2 with a weak te mperature depen- 
dence : ficu = 0.2T a266 (/i/0.5)- 3 / 2 flMohr, Mathieserj 
& Evard 1999). We note here that the value of ficM 
should be a lower limit to the ratio Q(,/f2o; the tempera- 
ture dependence reflects the fact that low mass clusters are 
more likely to loose their gas from energy-injection or pre- 
heating mechanisms, while the dependence on h depends 
on the method employed to measure it. The method of 
Mohr et al. (1999) is a determination for the whole clus- 
ters, while the method of e.g., Grego et al. (2000b) yields 
a different h dependence but is based on the central SZ 
decrement. We have checked that the results we present 
here do not depend qualitatively on the specific choice of 
the dependence of ficM on h. 



3. OBSERVABLES 

The observables of a cluster that we utilize in this paper 
are the redshift (z ), temperature (T), Sunyaev-Zeldovich 
total flux decrement (AS„), and angular size (9). Within 
our model, the redshift and temperature are indepen- 
dent parameters and have already been specified. Cluster 
redshifts can be obtained observationally e.g., from their 
member galaxies out to z Q ~ 1 through deep optical and 
near infrared observations, and cluster temperatures can 
be obtained from X-ray spectra out to similar redshifts. 
Here we define the Sunyaev-Zeldovich decrement (SZD) 
and the angular size. 

The SZD along a line of sight to a cluster in the 



Raylcigh- Je ans regime of the CMB is given by (e.g., Holder 
et al. 200C|) 



Icmb rn e c 2 



dln e (l)T e {l), 



(9) 



where g — —2, m e is the electron rest mass, c is the speed 
of light, ot is the Thomson cross section, Tcmb is the 
CMB temperature, T e is the electron temperature, n e is 
the electron number density, and the integral is along the 
line-of-sight. Expressed in terms of a decrease in the ob- 
served flux, AS„ = 2/cbATcmb^ 2 /c 2 dil (where dQ is the 
smaller of the solid angle of the observations and the solid 
angle subtended by the cluster), equation (0) implies that 
the total observed SZ flux decrement (ASC) for a cluster 
can be related to its total mass via 



AS U 



g2k 2 B v 2 a T TcMB\ , T e M 

/ICM " 



/j, e m e m p c 4 J dA(z) 2 



(10) 



where v is the frequency, /i e = 1.15 is the mean molecular 
weight per electron, m p the proton rest mass, dA is the an- 
gular diameter distance, z is the cluster redshift, /icm is 
the ICM mass fraction and M denotes the mass enclosed 



assumes that the electron density weighted mean temper- 
ature T e can be identified with the virial temperature as 
given in equation (||) (this is satisfied by definition in our 
assumed isothermal clusters; we implicitly assume that T e 
also equals the actual observed X-ray temperature Tx)- 
Finally equation ([l0|) assumes that the solid angle of the 
observations is larger than, or equal to the solid angle sub- 
tended by the cluster. Using equation (||), equation (10) 
can be rewritten in convenient units in terms of the ob- 
servables z and Tx as: 



[MJy] 



0.26(1 + z a ) 



h 



M 



[GHz] 
Tv h 2 



(11) 



ICM 



[10 14 M Q ] [KeV] d 12 



where we have introduced the dimensionless angular diam- 
eter distance d' A = dAHo/c, and z a is the cluster redshift. 
The observable quantities here are z , Tx, and AS^. 

In our model, the angular size of a cluster simply corre- 
sponds to the virial radius, 9 = R viT /dA- The spherically 
averaged profiles (of, e.g., temperature) found in numeri- 
cal simulations do indicate the presence of a virialization 
shock, as expected from the top-hat collapse model. How- 
ever, the shock tends to be weaker, and located at larger 
radii (see, e.g., Bryan & Norman 1998). Nevertheless, we 
here assume that the angular size is given by the virial 
radius, as obtained from the top hat collapse model (see 
further discussion below). 



[deg] / ICM l5.6(l + z )( TG ^ J ) 2 ( T ^ I ) 2 /i 



(12) 



Consistent with our earlier assumption, we have as- 
sumed that the cluster grows in mass by a factor of 
1//a/ = 1-3 between z/ and z . In practice, this angu- 
lar size can be measured accurately only if a cluster is 
resolved, and therefore an estimate of 9 will be available 
only for a fraction of the clusters in a dataset. For example, 
the Planck Surveyo r satellite will resolve only ab out ^ 1% 
of all SZE clusters ( [Kay, Liddle fc Thomas 2001| ), but the 
CSCS, with a 1.7' resolution, will resolve most clusters at 

!])• 



<, 1 (Page et al. 2001 



within the virial radius of the halo. Equation (10) also 



Angular sizes can, in principle, 
also be obtained from clusters resolved in X-ray observa- 
tions. In practice, this involves measuring the profile and 
fitting to a model; this fit will be constrained essentially 
by the core radius, rather than the virial radius (although 
the latter can, of course, be inferred by extrapolation once 
the radial profile was fit to the model) . 

It is likely that in a given sample, not all observables 
will be known for each cluster, and the model prediction 
for some observables (e.g., for the angular size) are less 
robust than for others. Below, we will briefly consider 
the ideal case, in which all four observables (z, T, AS„, 9) 
are reliably measured for a cluster sample. We will then 
concentrate on cases where only the subsets (z,T,AS u ) 
or (z,AS„,9) have been measured. The first of these is 
relevant e.g., to a sample of clusters detected in the inter- 
ferometric SZE survey of Holder et al. (2000), or to the 
MAP/Planck/CSCS cluster sample with X-ray follow-up 
and reliable temperatures; the second case is relevant to 
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a sample of well resolved SZE clusters (e.g., those of the 
CSCS) that are too distant (z J> 1) to obtain e.g., reliable 
temperatures. Current cluster samples exist where a com- 
bination of cluster sizes and temperatures are mea sured, 
and h ave been analyzed in the lit erature (see, e.g., |Mohi 
k EvJard 19971, IVerde et al 



200C) 




Fig. 1. — If clusters are standard candles, they populate a 
hypersurface in the 4 dimensional space of the observables 
(Tx, AS V , 9 and z). Here, we show a slice of this space 
at the constant redshift z a = 0.1. In this slice, clusters are 
constrained to be on a "fundamental plane" . The figure 
also shows the projection of this plane onto the "walls" of 
the box, defining the three scaling relations (AS V — Tx), 
(9 — AS V ), and (9 — Tx)- Not all locations in the funda- 
mental plane are equally likely to be occupied: the filled 
symbols show the distribution of 200 Monte-Carlo gener- 
ated clusters for our fiducial model. Deviations from the 
top-hat collapse model, or different choices of the cosmo- 
logical parameters, cause measurable deformations of the 
FP such as a shift in its position and orientation, and can 
also introduce a random scatter around it. 

4. THE FUNDAMENTAL PLANE OF GALAXY CLUSTERS 
4.1. The Fundamental Plane in the Fiducial Model 

At any given redshift z a , the equations (6), (11) and (12) 
(i.e. in the deterministic scenario) express the three ob- 
servables Tx, AS„ and 9 in terms of the two variables Zf 
and M. For a given frequency channel v of the SZE survey, 
we assume that the ICM mass fraction has the functional 
form described in § 2.5 and that the cosmological param- 
eters (D,Q,h, and erg) and /m are known. These relations 
- three equations between the five variables (Tx, AS U , 9, 
M, and Zf) - allow a study of the distribution of clusters in 
several useful projections of this 5-dimensional parameter 
space. In particular, clusters define a 3-dimensional mani- 
fold in the 4-dimensional subspace of the four observables 
(Tx, AS U , 9, z). It is, however, more practical to consider 
the three dimensional subspace (Tx, AS U , 9), and regard 
the cluster redshift as an evolutionary parameter. In this 
case, for a given redshift z a , clusters can only populate 
a half-plane in (Tx,AS v , #)-space. The restriction to a 
half-plane arises from the requirement that clusters form 
prior to the redshift at which they are observed, Zf > z a . 

In order to visualize this half-plane, which we will here- 
after refer to as the "fundamental plane" (FP), in Figure [l] 



we show an example of the distribution of clusters observed 
at the fixed redshift z Q = 0.1. The figure also shows the 
projection of the FP onto the "walls" of the box, which de- 
fine the three scaling relations (AS V —Tx), (9 — AS V ), and 
(9 — Tx)- Of course, not all locations in the FP are equally 
likely to be occupied: cluster temperatures, virial radii and 
SZ decrements cannot assume any arbitrary value between 
— oo and +oo; smaller clusters are more numerous than 
high temperature ones; and the width of the probability 
distribution of Zf decreases monotonically for larger clus- 
ters. As a result, clusters preferably populate a portion of 
the FP. In Figure |], the filled symbols show the distribu- 
tion of 200 clusters, selected randomly at z a = 0.1 using a 
Monte-Carlo method. 

If all observable quantities Tx, AS U , 9 and z are mea- 
sured in a cluster sample, and the top hat collapse model is 
the correct description of the physical properties of clus- 
ters, then the clusters define the FP shown in Figure [l| 
determined solely by the cosmological parameters. Devia- 
tions from the top-hat collapse model, or different choices 
of the cosmological parameters, can cause a shift in the po- 
sition and orientation of the FP, and can also introduce a 
random scatter around it. We will argue that it is possible 
to quantify such deviations from the simple top-hat model, 
and gain insight into both cluster physics and cosmologi- 
cal parameters, by studying the departures of the FP from 
its position and orientation in our "fiducial" model. As an 
example, a < 1 would modify the dependence of the FP 
on redshift z; while x > would induce a scatter around 
the FP. In the rest of this paper, we will quantify these 
statements, by concentrating on different projections of 
the FP. 



4.2. The Fundamental Plane and Cluster Scaling 
Relations 

As mentioned above, the projections of the FP de- 
fine three scaling relations, (AS V — Tx), (9 — AS V ), and 
(9 — Tx)- The last of these relations can be analyzed using 
existing clust er samples, and have been studied by var ious 
authors (e.g., |Mohr fc Evrard 1997[ |Verde et al. 200C| ). In 
this paper, we focus on the usefulness of the first two scal- 
ing relations, which will be observationally available from 
future SZE cluster surveys. 



1000.00 r 



x 
o 



J4 



in 
<i 



0.01 







M = 3x10 ,5 M„/ [ 




h = 0.9/ 


~7 Z,=5 - 


h = 0.2 






Jyl = 1x10 ,3 M„// 










z o =0.1 - 



0.1 



1.0 
T x [KeV] 



10.0 



Fig. 2. — Theoretical expectations for the projection of 
the fundamental plane defining the (AS V — Tx) scaling 
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relation. The solid line shows the location of clusters in 
our fiducial model at redshift z a = 0.1 with masses be- 
tween 10 13 M < M < 3 x 10 15 M Q , assuming that 
they formed at the redshift at which they are observed, 
Zf = z = 0.1. The dotted arrow shows how the location 
of a 2 x 10 14 Mq cluster depends on its formation redshift 
for 0.1 < Zf < 5. Similarly, the dashed arrow shows how 
the location of this cluster would change with the Hubble 
constant for 0.2 < h < 0.9. Changes in f2o or a% mainly 
affect how the clusters are distributed in the direction of 
the dotted arrow. Note that this arrow is nearly paral- 
lel to the solid line; this suggests that the scatter in the 
(AS V - T x ) should be small. 
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Fig. 3. — Theoretical expectations for the projection of 
the fundamental plane defining the (AS^ — 8) scaling re- 
lation. The solid line, and the dotted and dashed arrows 
describe the same clusters as in Figure ^. Note that in 
difference from Figure @, the dotted arrow, representing 
increasing formation redshifts, is nearly orthogonal to the 
solid line. This suggests that the scatter in the (AS V — 8) 
should be large, and sensitive to f^o, erg and primordial 
non-gaussianity (the same conclusion holds for the size- 
tem perature relation, as can be seen from Figure |l|; see 
also |Verde et al. 200C| ) . 



Figures || and || show the theoretical expectations for 
the two projections of the FP that involve the Sunyaev- 
Zeldovich decrement, together with the expected position 
of clusters in our fiducial model. Figure || shows the pro- 
jection of the FP onto the (A^ — Tx) plane. The solid 
line shows the location of clusters at redshift z = 0.1 
with masses between 10 13 M < M < 3 X 10 15 M in 
this plane, assuming that these clusters formed at the red- 
shift at which they are observed, Zf = z a = 0.1. The 
dotted arrow shows how the location of a 2 x 10 M 
cluster in this plane depends on its formation redshift for 
0.1 < Zf < 5. As the arrow demonstrates, both the tem- 
perature and SZD are increased for clusters that form ear- 
lier. Similarly, the dashed arrow shows how the location 
of this cluster would change with the Hubble constant for 
0.2 < h < 0.9. Both the temperature and SZD decrease 
if the Hubble constant is lower. Note that changes in £Iq 
and erg mainly affect how clusters are distributed in forma- 
tion redshift, i.e. along the direction of the dotted arrow. 
Since this is nearly parallel to the solid line, a change in f^o 
or us is nearly degenerate with a change in cluster mass. 



This suggest that the expected scatter in the (AS,, — Tx) 
relation should be small, and insensitive to Slo and os . No- 
tice also that low values of h make clusters appear colder, 
mimickingan unphysical formation redshift Zf < z Q . 

Figure [| shows the projection of the FP onto the 
(AS V — 8) plane. As in Figure [| the solid line shows 
the location of clusters at redshift z Q — 0.1 with masses 
between 10 13 M < M < 3 X 10 15 M , while the dotted 
and dashed arrows demonstrate the dependence of the lo- 
cation of a 2 x 10 14 M cluster on its formation redshift 
and on the Hubble constant. Note that in difference from 
Figure |[ the dotted arrow, representing increasing forma- 
tion redshifts, is nearly orthogonal to the solid line. This 
suggests that the scatter in the (A^ — 8) should be large, 
and sensitive to Qq, as and primordial non-gaussianity. 
The same conclusion holds for the size-temp erature rela- 
tion, as can be seen from Figure [l] (see also Verde et al 



2000) 



5. PROBING THE INTERNAL PHYSICS OF CLUSTERS 



2.1 



is 



The idealized model for clusters described in 
based on the collapse of a spherical top hat perturbation 
and is the simplest model to relate cluster observables. 
It is important to have observational tests for departures 
from this model; both in order to understand the struc- 
ture of clusters themselves, and also to allow clusters to 
be used as "standard candles" in cosmological studies. As 
discussed above, all four observables (AS U , T, z, 8) might 
not be available for every cluster. This prompts us to con- 
sider what constraints are possible both with and without 
knowing the angular size 8. In this section we will assume 
that cosmological parameters are known and use the defor- 
mations of the FP to test whether clusters are "standard 
candles" . 

5.1. Testing for the Origin of Scatter 

A striking feature of the FP in our fiducial model, shown 
in Figure 0, is its orientation, which implies that the 
three projections have significantly different scatter. In 
particular, a relatively large scatter is predicted around 
the (8 — AS„) and (6 — Tx) scaling relations, while the 
(AS V — Tx) relation remains much more tightly defined. 
Figures || and || reveal that this result follows from the 
deterministic nature of our fiducial model, where the only 
source of scatter in the observables is the distribution in 
formation redshift Zf of clusters (which, ultimately, is a 
direct conseq uenc e of cosmological initial conditions) . As 
discussed in § 2.4 above, the deterministic scenario, with- 



out any additional sources of scatter, might be consis- 
tent with existing data (i.e. the scatter in the 8 — Tx 
and M — Tx relations). Nevertheless, it is interesting to 
consider alternative options, those in-between the two ex- 
treme deterministic and stochastic scenarios. 

The parameters x, and /a/, introduced in § 2.4 above, 
quantify how much of the observed scatter is caused by 
cosmological initial conditions, or by random variations 
from cluster to cluster, arising from their internal physics. 
To quantify this we Monte-Carlo generate the M — T re- 
lation of a mock catalog of about 250 clusters, first in our 
deterministic fiducial model with (/m = 0.75, x = 0), and 
then for different combinations of /m and x. We then 
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compare those to the fiducial model with a 2 dimensional 
Kolm ogorov-Smirnov test (e.g., Press et al. 1992 , Peacock 
1983["| |Fasano fc Franceschini 1987| ). This test computes 
the D statistic, that is the maximum cumulative differ- 
ence between the two distributions, and then gives P>d, 
the probability that D could be greater than observed. 
Small values of P>d indicate that it is extremely likely 
that the two distributions differ, for example a conser- 
vative choice is usually P>d < 0.01 which implies that 
the two data sets are significantly different. We have 
checked with multiple realizations that if two datasets are 
drawn from the same underlying distribution (i.e. our fidu- 
cial model) P>d > 0.317 about 68% of the times and 
P>d > 0.046 about 95% of the times. In what follows we 
will thus plot the equal probability contours corresponding 
to P >D = 0.317, 0.046 and 0.01. 

Figure [| show these equal probability contours. Note 
that the deterministic scenario corresponds to (/m = 
0.75; x = 0); the stochastic scenario corresponds to (/a/ = 
l;x > 0); and in-between cases are described by (/m < 
l;x>0). 
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Fig. 4. — We illustrate a probabilistic study of the ori- 
gin of the scatter in the M — Tx relation. The scat- 
ter can arise from cosmological initial conditions (/a/ = 
0.75, £ = 0), or from a stochastic source (/m = 1). 
Other choices of (/m, x)-values correspond to a combi- 
nation of both effects. The equal-probability contours for 
P >D = 0.317, 0.046, 0.01, where P >D is the likelihood that 
the two data sets are drawn from the same distribution, are 
shown in the Jm~x plane. The likelihoods were obtained 
from a 2D Kolmogorov-Smirnov (KS) test of the M — Tx 
distribution of 250 clusters at redshift z — 0. Probabilities 
are obtained by comparing the M — Tx distributions to 
that in our fiducial flat ACDM model with Jm — 0.75 and 
x = 0. The parameter space outside the light gray area is 
excluded by the KS test at ~ 68% confidence. 

Figure ||has two important implications. First, the pa- 
rameter space outside the light gray area is excluded by 
the KS test at the 68% confidence level (P >D < 0.317) , 
indicating that the M—Tx relation by itself can already be 
a useful discriminant for the source of scatter. Forthcom- 
ing datasets can yield a local sample of many more than 
250 clusters with size, temperature, and mass estimates, in 
which case the constraints shown in Figure ^ can be signif- 



icantly improved. Second, Figure [| suggest that although 
their combination is tightly constrained from the M — Tx 
distribution alone, a degeneracy appears to still remain 
between /m and x. A comparison with Figure [j] suggest 
that this degeneracy can be broken by considering, in ad- 
dition, the distribution of clusters along the AS*,, — Tx 
relation. For models where the scatter is cosmological 
(Jm ~ 0.75), this relation is significantly tighter than the 
9 — Tx distribution. On the other hand, in "stochastic" 
models (/m ~ 1), the scatter in AS V — Tx should be com- 
parable to that in M — Tx ■ 



T [KeV] 
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Fig. 5. — The distribution of 100 Monte Carlo clusters in 
the AS",, — T x plane. Clusters were selected with masses 
above 10 14 /i _1 Mq, observed at redshifts z = 0.1 (dia- 
monds), z a — 0.5 (triangles), and z D — 1 (crosses). The left 
panel corresponds to our fiducial model, in the determinis- 
tic scenario, where the scatter is purely from cosmological 
initial conditions. The right panel shows the stochastic sce- 
nario , with a stochastic scatter. Although both scenarios, 
by construction, have the same scatter in the M — Tx rela- 
tion, the stochastic scenario predicts a significantly larger 
scatter in AS U — Tx than the deterministic one. 

We therefore next examine constraints on (Jm,x) that 
will be available from clusters with measured z and AS U . 
In order to illustrate how the scatter in the A5 V — Tx 
relation differs in the stochastic and deterministic scenar- 
ios, in Figure || we show the distribution of clusters in this 
plane. We used Monte-Carlo realizations of our fiducial 
model to generate a cat alog of ~ 100 cluster s with masses 
above lO 14 /^ 1 M Q (cf. |Holder et al. 2000| ), observed at 
each of the redshifts z a = 0.1, 0.5, and 1. 

The left and right panels of Figure |B| show the distri- 
butions in the deterministic and stochastic scenarios, re- 
spectively. A comparison of the two panels reveals that 
at any redshift, the scatter in the deterministic scenario 
is small; while in the stochastic scenario, it is significantly 
larger. We find that the two scenarios differ increasingly 
towards higher redshifts. This is not surprising, since in 
the deterministic scenario, the scatter is reduced by the 
narrower distribution of cluster ages at high-z, while in 
the stochastic scenario, there is no similar trend. 

In what follows, we find it useful to define a combination 
rj of the observables (AS„, T, z) by 



V 



2.45 x W- 4 {T x /[KeV} 



\5.532 



(1 + z)r> p(z)(A^/[ AlJy /GHz 2 ]/M^ 2 ) 2 



(13) 



Under the assumption that a cluster at redshift z formed 
at the same redshift Zf = z, r\ would simply equal the 
Hubble constant r\ = h. In general, for a sample of clus- 
ters with different ages, r\ > h since Zf > z. As a result, in 
our fiducial model (deterministic scenario), for any given 
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redshift or temperature, the lowest value of 77 (r] m in) in a 
cluster sample gives an estimate of h. In the left panel of 
Figure ^|) , we show the distribution of clusters in the 77 — z 
plane. Inadequacies of our fiducial model will show up as 
a systematic dependence of h (j] m i n ) on z (and/or on Tx] 
see further discussion below). 

We nex t q uantify how the combination 77, defined in 
equation (13t), depends on z and Tjj in different models. 
We generate a Monte-Carlo mock catalog of ~ 300 clus- 
ters between redshift < z < 1 in our fiducial model. 
We impose the restriction z < 1 because of the increasing 
difficulty to obtain accurate cluster redshifts for more dis- 
tant clusters. The survey proposed by Holder et al. (2000) 
will yield about 300 clusters with z £ 1. Similarly, in the 
CSCS survey proposed by Page et al. (2001), about ~ 300 
of the detected SZE clusters will have redshifts measured 
in X-rays and/or optical follow-up observations. For each 
cluster in our mock catalog, we then compute 77, using 
equation (|l3|). 
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Fig. 6. — Scatter plot of 77 from Monte Carlo simula- 
tions of ~ 300 clusters. The true underlying cosmological 
parameters are those for our fiducial model (fio = 0.4, 
Ao = 0.6, h = 0.65). The horizontal dashed line show 
where clusters for which z a — Zf should lie. The left panel 
corresponds to our deterministic fiducial model, while the 
right is for the stochastic model. Note the different scale in 
the y axis and the different distributions of points relative 
to the dashed line. 

Using these Monte-Carlo catalogs, we can now deter- 
mine constraints on (/a/, x). We compare mock catalogs of 
~ 300 clusters, assuming different combinations of (x, /m) 
in otherwise identical models. Instead of using the de- 
terministic scenario for the benchmark model, we adopt 
Im = 0.85 and x — 0.065. This combination lies approx- 
imately in the center of the degenerate parameter range 
shown in Figure ||, and hence it allows us to quantify the 
"distance" of both deterministic and stochastic scenarios 
from this intermediate case. The P>d=0.317, 0.046, and 
0.01 probability contours obtained from these KS tests are 
shown in Figure ^. 
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Fig. 7. — Equal probability contours at P>d=0.317, 0.046 
and 0.01 in the (x, Jm) plane, obtained from 2D KS tests 
between distributions of the clusters in the (77, z) plane. 
Each pair of [x, Jm) was used to generate a mock cata- 
log of 300 clusters, and its (77, z) distribution was com- 
pared to that in a benchmark model with Jm = 0.85 and 
x = 0.065. The parameter space outside the light gray 
area is excluded by the KS test at 68% confidence. The 
two sets of transparent contours show P>d = 0.317 and 
P >D = 0.046 (nominally 68.3% and 95.4% respectively) 
contours obtained by generating mock catalogs down to 
a factor 2 lower value of M m i„ than in the fiducial model 
(left panel); or similarly, assuming the wrong value of h 
by +5% (right panel). Larger errors on M min or h would 
imply that no combination of (x, Jm) produces a (z, 77) 
distribution consistent with our fiducial model. 

The shaded contours (identical on both panels) reveal 
that tight constraints can be derived on a combination of 
(x, Jm), similar, but not identical to those from the M — T 
relation. In using these types of constraints, it is impor- 
tant to compare observational data with model catalogs 
extending down to the "correct" minimum mass, i.e. the 
true mass of the smallest observed cluster. The shaded 
confidence regions were obtained by implicitly assuming 
the right minimum mass (M m i n ) of the survey. To demon- 
strate the sensitivity of this method to the limiting mass, 
in the left panel of Figure [7| the set of transparent con- 
tours shows P>d = 0.317,0.046,0.01, obtained assuming 
a value for M min that is too low by a factor of two. Sim- 
ilarly, the transparent contours in the right panel show 
the results we obtain if we assume the wrong value of h 
by +5%. A larger error on these two parameters would 
results in all combinations of (x, Jm) producing a (z, 77) 
distribution that is inconsistent with the distribution in 
our benchmark model. This conclusion does not change 
if we reverse the sign of the change in M m i n and h. This 
suggests that this type of study can be used to simultane- 
ously constrain (x, Jm), as well as M m - la and h. Our results 
in Figure also suggest that if one marginalizes over the 
allowed ranges of M m ; n and h, the errors on x and Jm will 
not be significantly increased. 



5.2. Cluster Evolution and Feedback 



In § 2.3 we introduced two parameters that describe 
deviations arising from energy injection, or other feed- 
back (£); and from lack of full-virialization, or redshift - 
evolution due to other reasons (a). In our fiducial model, 
£ = 1.5 and a = 1. It is interesting to quantify the effects 
on the cluster scaling relations of different choices for both 



10 



GALAXY CLUSTER SCALING RELATIONS 



parameters. As suggested in the previous section, an inad- 
equacy in our fiducial model will show up as a systematic 
dependence of r\ (eq. 13) on z and/or Tx- For example, 
a 20% variation in the value of £ has a sizeable effect in 
the distribution of r\ vs T, as we demonstrate in the right 
panel of Figure ||. 
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Fig. 8. — Scatter plot of t] from Monte Carlo simula- 
tions of ~ 300 clusters. The true underlying cosmological 
parameters are those for our fiducial model (Oo = 0.4, 
Ao = 0.6, h = 0.65). The horizontal dashed line show 
where clusters for which z Q — Zf should lie. The left panel 
corresponds to our deterministic fiducial model, while in 
the right panel the parameter £ in the scaling relation 
(eq. ^J) has been changed from the fiducial value £ = 1.5 
to £ = 1.8 (diamonds) and £ = 1.2 (crosses). 

In this figure, we show r\ vs. T for £ = 1.5 (left panel), 
and for £ = 1.8 (right panel; diamonds) and £ = 1.2 (right 
panel; crosses), in the fiducial model. 

In order to quantify the minimal level of deviations that 
can be measured in future SZE surveys, we performed a 
2D KS test in the r\ — z distribution for the a param- 
eter using Monte Carlo simulations of 300 clusters. We 
find that deviations from a — 1 by ±0.03 are detectable 
at 68% confidence level. Performing the same test in the 
rj — Tx plane for the £ parameter, we find deviations from 



1.5 



+0.02 
-0.05 



are detectable at the 



the value £ = 1.5 by £ 
same significance. 

If angular sizes of ~ 300 clusters in a survey are avail- 
able, in addition to (AS V ,T, z), the values of x, Jm and 
cosmological parameters can be accurately determined. 
From equation (12) we obtain: 
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Fig. 9.— Equal probability contours at P>d=0.317, 0.046 
and 0.01 in the (x, Jm) plane, obtained from 2D KS tests 
between distributions of the clusters using the combination 
of four observables as in equation (|l4|) as a function of z. 
The combination of these contours with those of Figure 
allows a unique combination of (x, Jm) to be determined. 

These properties of the combination of observables in 
equation (fll) allows a unique combination of (x, Jm) to be 
measured. Figure [9] shows the equal probability contours 
(P>d=0.317, 0.046, 0.01) in the Jm — x plane for a sample 
of 300 clusters: the "product" of the probability contours 
of Figures ^ and allows a unique combination of (x, Jm) 
to be determined! In addition, the rcdshift dependence 
of equation (p~4l) directly probes the redshift evolution of 
the angular-diameter distance, and therefore can be fur- 
ther utilized to measure cosmological parameters, as we 
will discuss in § 6.3. 

6. PROBING COSMOLOGICAL PARAMETERS 

In this section, we will assume that clusters can be used 
as standard candles, i.e. that our modeling of cluster 
physics is accurate. The deformations of the FP relative 
to its shape in the fiducial model can then be used to de- 
termine cosmological parameters. 



h 



[MJy] J M 



)15.6 



(14) 



If we consider this combination of observables as a func- 
tion of z a , we find that (a) the scatter around this relation 
is not sensitive to /m, and hence it can only be due to 
a non-zero value for the parameter x; and (b) the abso- 
lute normalization of the relation depends on the angular 
diameter distance and Jm- In particular, if cosmological 
parameters are known, the latter depends only on /m- 
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Fig. 10. — Equal probability contours in the (h — Sl ) 
plane obtained from the 2D KS test of the 77 — z distribu- 
tion, similar to those shown in Figure ^. The parameter 
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space outside the light gray area is excluded by the KS 
test at the ~ 68% level (P>d = 0.317), the other two con- 
tours correspond to ~ 95% and ~ 99% confidence levels. 
The "true" underlying value is indicated by the cross. The 
dotted and dashed contours show the first two likelihood 
contours (P>d = 0.317 and 0.046), obtained by assuming 
a minimum mass for the survey M m i n two times lower and 
two times higher than the 10 14 /i _1 M used in the mock 
catalog. The estimates of h are not strongly affected by 
the choice of M m [ n . In practice, one will marginalize over 
-Mmin: this will increase the uncertainty in the recovered 
flo, but not in h. The solid line, together with the two 
dashed lines, show the constraint floh 2 =constant, and 
±5% uncertainties, expected from forthcoming CMB ex- 
periments. 



6.1. Constraints from AS U ,T, and z. 

Assuming only AS U and Tx and zq are available, a 2D 
KS test applied to the rj — z distribution (see Figure ^, 
can be used to constrain O, h. It is important to note 
here that this measurement of the Hubble constant, based 
on a combination SZE and X-ray data, is different from 
an existing method proposed by e.g., Gunn (1978), Silk 
& White (1978), Birkinshaw (1979). The latter method, 
by combining the central SZ decrement with X-ray central 
temperature, yields an estimate of the length of the clus- 
ter; assuming that the cluster is spherical this can be used 
as an angular diameter distance test. By averaging over a 
large cluster sample, effects of cluster asphericity can be 
averaged out. In this method clusters are used as standard 
rulers. The method presented here is complementary; it 
uses the total observed SZ flux decrement, which can be 
directly measured, requires data that is easier to obtain 
(no detailed SZ and X-ray map of the cluster are needed) 
and by making use of the whole cluster SZD, should there- 
fore be less sensitive to the details of cluster physics near 
the center. This method relies on different assumptions; it 
assumes clusters are virialized (although deviations from 
virialization can be parameterized, as discussed in § 2.3 
and 2.4). In our method, clusters are used as standard 
candles. The expected statistical errors for a cluster sam- 
ple of the same size are comparable for the two methods, 
but the possible systematics are of entirely different na- 
ture. 

Results of the 2D KS test of the r\ — z distribution of 
O, h are shown in Figure |l0|. The parameter space out- 
side the light gray area is excluded by the KS test at the 
68% level (P>d — 0.317), the other two contours corre- 
spond to P >D = 0.046 and P >D = 0.01. The "true" 
underlying value is indicated by the cross. The dotted 
and dashed contours show the first two likelihood con- 
tours (PyD = 0.317 and 0.046), obtained by assuming a 
minimum mass for the survey M m j n two times lower and 
two times higher than the 10 14 /i _1 M used in the mock 
catalog. The estimates of h are not strongly affected by 
the choice of M m j n . In practice, one will marginalize over 
Mmin: this will increase the uncertainty in the recovered 
but not in h. The solid line, together with the two 
dashed lines, show the constraint floh 2 =constant, and 
±5% uncertainties, expected from forthcoming CMB ex- 
periments. This suggests that the two constraints are suf- 



ficiently different, and can be combined together to break 
the degeneracy between J7q and h. 



6.2. Constraints from AS„, 9, and z. 




etorcmin] 



[ocrmin] 




Fig. 11. — The dependence of the IS.S V — 9 distribution 
on <7g (AS v /v 2 is in units of /iJy/GHz 2 ). The diamonds 
show a Monte Carlo sample of 25 clusters at z = 0.3 with 
9^2' (close to the specifications of the CSCS survey) for 
O = 0.3, A = 0.7, h = 0.65 and a 8 = 0.99; the dots show 
the AS u -9 distribution of ~ 100 Monte-Carlo simulated 
clusters for different choices of as ■ 
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Fig. 12. — Equal probability contours in the as — &o 
plane, obtained from the AS V — 9 relation. The parameter 
space outside the light gray area is excluded at the 68% 
(P>d = 0.317) level by the KS test. The constraint we 
obtain can be approximated by as — 0.92Qq . For ref- 
erence, the solid line shows the constraint in the as — fio 
plane obtained from the local abundance of massive clus- 
ters. A comparison suggests that the two constraints can 
be used together to measure and as simultaneously. 

Assuming that clusters in the sample are resolved, and 
that their angular sizes (0) are measured, a 2D KS test 
applied to the AS 1 ,, — 9 distribution (see Figure [ll]), can 
be used to constrain Qq and as- In Figure [□], we illus- 
trate the dependence of the AS",, — 9 distribution on erg 
(AS v /v 2 is in units of /zJy/GHz 2 ). The diamonds show a 
Monte Carlo sample of 25 clusters at z a = 0.3 with 9 ^ 2' 
for n = 0.4, A = 0.6, h = 0.65 and a 8 = 0.99; the dots 
show the AS u -9 distribution of ~ 100 Monte-Carlo simu- 
lated clusters for different choices of as- It is clear that a 
too large/small as gives rise to a too large/small scatter 
in the (AS V - T) relation. 

The results are demonstrated in Figure [l^, where we 
show the equal probability contours in the as — plane 
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obtained from the AS V — 9 relation. The fiducial model 
is indicated by the cross (here we have dropped the con- 
straint as = 0.495f2,J ' 6 ). As before, the parameter space 
outside the light gray area is excluded at the 68% level 
by the KS test, and the other two sets of contours corre- 
spond to 95% and 99% confidence levels. The constraint 
obtained from the AS V — 9 relation can be approximated 
by a § = 0.92Qq . For comparison, the solid line shows 
the constraint in the as — f^o plane obtained from the lo- 
cal abundance of massive clusters. A comparison of these 
two methods suggests that the two constraints can be used 
together to measure both fig and as- 

6.3. Constraints from AS V ,9, T and z. 

As already mentioned in § 5.2 above, if all observables 
are known for a cluster sample, and if /m and x are rea- 
sonably well constrained, then equation (|14| ) yields the an- 
gular diameter distance as a function of rcdshift. This 
quantity depends on f^o, A, h and the equation of state 
parameterized by w; it can thus be used to constrain these 
parameters. Several other observables have already been 
used to measure the angular diameter distance as a func- 
tion of redshift, such as type la supernovae (e.g., Reiss 
et al. 1998; Perlmutter et al. 1998) or the redshift evo- 
lution of the linear power spectrum shape (e.g., McDon- 
ald & Miralda-Escude 1999; Roukema & Mamon 2000). 
Galaxy clusters potentially offer an independent test. The 
accuracy with which the angular diameter distance can be 
measured will strongly depend on the observational un- 
certainties in AS„,9 and T and will be addressed in a 
subsequent work. 

7. A COMPARISON OF TWO DIFFERENT STATISTICS 

Throughout this paper, we have adopted the 2D KS 
test to quantify the discriminating power of the scaling 
relations between different models. In our study, we have 
focused on the shapes of the scaling relations, and have 
not explored constraints from the total number of detected 
clusters, which itself is a function of cosmological parame- 
ters. The cosmology dependence of the number counts has 
been recently studied by Haiman, Mohr & Holder (2001) 
and Holder, Haiman & Mohr (2001) to forecast precision- 
constraints that will be available on the equation of state 
parameter w, as well as the parameters f2o,^Aj and as- 
These constraints are complementary to those that de- 
rive from the shape of the scaling relations, and the two 
methods will have different systematic errors. For exam- 
ple, the fundamental plane approach requires extensive X- 
ray/optical follow-up of SZ surveys, or requires the tem- 
peratures of SZ selected clusters to be obtained from dif- 
ferent catalogs. This can introduce selection effects which 
affect the number of detected clusters, but not the shape 
of the scaling relations. The KS test has several attractive 
features, such as its distribution independence and the fact 
that is a robust and more direct method. It is a particu- 
larly appropriate statistic for the present application, since 
it distinguishes distributions based only on their shape, 
independent of their overall normalization. Nevertheless, 
one must bear in mind that the KS test is optimized to 
detect "shifts" in distributions, and is less well-suited in 
distinguishing among distributions with significant "tails" 



(e.g. Press et al. 1992). In order to assess the robustness 
of our results to the choice of statistic, we here compare our 
analysis of the constraint on the Hubble constant h with 
constraints obtained from a maximum likelihood method. 
In applying both types of statistic, we utilize the distribu- 
tion of clusters in the (rj — z a ) plane, as defined in § 5.1 
above, to derive constraints on the Hubble constant. 

In general, a maximum likelihood analysis is not directly 
comparable to the KS test, since the former utilizes infor- 
mation from the total number of clusters, while the lat- 
ter is insensitive to it. However, in the case of constrain- 
ing the Hubble constant, we find that the total number 
of clusters is insensitive to the value of h. As an exam- 
ple, in our fiducial model with h = 0.65, we predict, by 
construction, 300 clusters between redshifts 0.1 < z < 1, 
in a solid angle of ~ 20 deg 2 , above the limiting mass 
M min = 10 14 /i _1 M Q . Assuming that the minimum mass 
Mmin scales as M mm oc (e.g. from mass determina- 
tions from X-ray profiles, weak lensing, etc.), and assum- 
ing further that the power spectrum has been indepen- 
dently measured, we find that the total count is essen- 
tially independent of h. Lack of a-priori knowledge of the 
power spectrum introduces a small /i-dependence through 
the "shape parameter" V ~ Qh: we find 297.3 clusters for 
h = 0.64. Finally, assuming a fixed /icm, the mass corre- 
sponding to a constant SZ decrement scales more strongly, 
as -Mmin cx /i~ 8 / 5 (with the /icm dependence used in this 
paper, the dependence would be weaker). Under this scal- 
ing, including the /i-dependence of the power spectrum, we 
find 291.5 clusters for h — 0.64, implying that based on the 
number counts alone, models with h = 0.64 and h = 0.65 
would only be 0.5a apart. We conclude that in the case of 
constraints on h, we can directly compare the likelihood 
and the 2D-KS test performances, both of which will test 
primarily the shapes, rather than the normalizations, of 
the underlying distributions. 

The observation of a discrete number N clusters is a 
Poisson process, the probability of which is given by the 
product 

P = l[(e^eM-ei}/n i y) (15) 
i=i 

where rij is the number of clusters observed in the i th ex- 
perimental bin, and is the expected number in that bin 
in a given model: = I(xi)Sxi- Here, / is proportional to 
the probability distribution, and x denotes the set of ob- 
servables, in the present (rj,z ). Following Cash 
(1979), for unbinned data (or equivalently for very small 
bins that have only or 1 counts) we define the quantity 

N 

C= -21nP = 2(E-^2 ( 16 ) 

i=l 

where N = 300 is the total number of observed clusters 
(i.e. the number of clusters in the mock catalog for our 
fiducial model with h = 0.65), E is the expected number 
of clusters in models with arbitrary h. We have omitted 
all terms that involve dxi since they do not depend on h. 
The quantity AC, defined in two models with different h, 
has a x 2 distribution (e.g. Cash 1979). Both the 2D-KS 
test and the maximum likelihood test are performed in the 
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r) — z a plane, where I(r), z a ) is given by: 

d 2 N 



I(V> z a) 



drjdz 



(17) 



where 
d 2 N 



dr]dz 



dn dV 
drj dz dQ 

drj N ' 

dz f 



-An 



Mn 



(18) 



dN 
dM 



dP dV ^ 
dzf dz dfl 



Here dN/ dM is the Press-Schechter mass function, dP/ dz 
is the formation redshift distribution from Lacey & Cole 
(1993), and dV/dzdQ. is the comoving cosmological vol- 
ume. We have assumed a solid angle AJ7 = 20 deg 2 , and 
included clusters at redshifts 0.1 < z < 1 in our analysis. 
Recall also that 



h 



{\ + Zffp{Zf) 



(1 



(19) 
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Fig. 13. — Comparison of 2D-KS and maximum likelihood 
confidence levels. Since the cluster abundance is insensi- 
tive to h it is possible to directly compare the performances 
of the two tests for a sample of 300 clusters in out fidu- 
cial (deterministic) model. The likelihood is obtained fol- 
lowing equations (jlfj) and (jl9j). The maximum likelihood 
method is much more sensitive that the KS test, but it 
requires good knowledge of the cluster mass function and 
of mask and selection effects of the survey. 

In Figure |l3|, we show the comparison between the like- 
lihoods obtained from AC and the from the 2D-KS con- 
fidence levels, as a function of h. Since the likelihood is 
non-Gaussian, the confidence levels have been obtained by 
integrating the likelihood curve. The figure reveals that 
the likelihood method is more sensitive than the KS test. 
For example, in the KS case, the model with h = 0.64 is 
approximately la away from the fiducial h — 0.65 model, 
while the AC test puts this model at > 2a significance. 
This finding suggests that a maximum likelihood method 
better captures the shape of the rj vs z distribution of clus- 
ters, and can significantly improve on the statistical con- 
straints quoted here. We expect that a similarly improved 



use of the distribution shapes may be possible in other 
cases (e.g., in the measurements of Oo)- However, con- 
straints obtained from a likelihood analysis can be domi- 
nated by the information available from the total cluster 
abundance, rather than the shape. In those cases, and im- 
proved utilization of the shapes can result in only marginal 
improvements of the overall constraints, and, at the same 
time, the use of the maximum likelihood method would 
subject the results to additional systematic errors. A sys- 
tematic study of the best statistic in each case is beyond 
the scope of the present paper, and is deferred to a future 
publication. 



8. DISCUSSION 

In anticipation of extensive data on a large sample of 
galaxy clusters, it is important to analyze cluster prop- 
erties to reduce systematic errors; to test whether physi- 
cal properties of the local cluster sample are preserved at 
high redshift and, if necessary, to model their evolution. 
We present a first attempt to do so self-consistently from 
cluster data. 

The approach presented here assumes that the primor- 
dial fluctuation field was Gaussian. However, clusters are 
rare peaks of the density fluctuation and are therefore 
extremely sensitive to small deviations from gaussianity. 
Nevertheless, this assumption is not crucial: a parame- 
terization of no n-gaussianity could easily be inc luded in 



Silk 1999 



; Matar- 
2000: Verde et 



the model (e.g., Robinsor, Gawiscr 
rese, Verde & Jimenez 2000; Verde et al 
al. 2001). 

We have used four observables: the redshift, the X-ray 
temperature, the total observed SZ flux decrement, and 
the cluster angular size. A particular worry in our ap- 
proach is that we estimate the angular size based on the 
virial radius. Current numerical simulations of cluster for- 
mation (see, e.g., Bryan & Norman 1998) indeed indicate 
the presence of a virialization shock, as expected from the 
top-hat collapse model. However, the shock tends to be 
weaker, and located at larger radii, which can make its 
detection in the SZE maps difficult, or impossible. Clus- 
ter angular sizes can also be estimated from X-ray maps, 
which, however, effectively probe only the core radius. 

Alternatively, similar scaling relations could be obtained 
by considering other observables, in addition to those we 
have considered here. The "fundamental plane" approach 
can be generalized to include, for example, the X-ray lumi- 
nosity, the velocity dispersion, or estimates of the cluster 
masses (e.g. from weak lensing). This opens up many in- 
teresting possibilities. The constraints on clusters physics 
and cosmological parameters are complementary to con- 
straints obtained from other observables; they could thus 
be combined together not only with e.g., CMB constraints 
but also with constraints obtained e.g., from X-ray clusters 
abundances, SZE cluster abundances, analysis of kinetic 
SZE and weak lensing statistics. 



9. CONCLUSIONS 

In view of the advent of high-precision cosmology and 
the expected avalanche of cluster data available with SZ 
experiments in conjunction with X-ray missions, it is vital 
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to be able to extract the maximum amount of information 
form these cluster samples. 

We have investigated the scientific potential of obtaining 
total observed SZ flux decrements (A5„) in the context of 
a well studied cluster sample, for which X-rays and opti- 
cal follow-up are available. In particular, for cosmological 
studies, one would ideally like to use clusters as "standard 
candles" : the use of large future galaxy cluster surveys for 
cosmological studies is likely to be limited by the validity 
of this assumptions, rather than by statistical uncertain- 
ties (Haiman, Mohr & Holder 2001). 

We used a semi-analytic model to study the expected 
distribution of galaxy clusters in redshift (z), virial tem- 
perature (T), Sunyaev-Zeldovich decrement (A5„), and 
angular size (9). In the simplest models, clusters are iden- 
tified with virialized, spherical halos. In this case, at every 
redshift, clusters define a "fundamental plane" (FP) in the 
three dimensional parameter space (T, AS U ,6). The FP 
and its redshift-evolution are sensitive to both the internal 
evolution of clusters, and to the underlying cosmological 
parameters, and can be used to obtain useful constraints 
on both. We have parameterized possible deviations from 
this model to include effects of energy injection or feed- 
back, stochastic scatter in the observables (T, AS V , 0), and 
deviations from virial equilibrium. We have shown that 
their effect is to create measurable deformations in the 
FP. 

We have thus derived predictions for clusters scaling re- 
lations that involve the SZ decrement, and studied how 
these scaling relations depend on assumptions about the 
cluster physics and structure (i.e. on the assumption that 



clusters are standard candles), as well as on the underly- 
ing cosmological parameters. In particular, we find that, 
if clusters are virialized objects, the cluster distribution in 
the AS U — T plane should be narrow. The predicted tight- 
ness of the AS*,, — T relation makes it especially useful 
for quantifying clusters physical properties, possible de- 
viations from virialization, and to detect the presence of 
stochastic scatter in (T, AS V ,6), i.e. it is a useful tool to 
test whether clusters can be used as "standard candles" . 

On the other hand, under the assumption that clusters 
can be used as "standard candles", we show that defor- 
mations of the fundamental plane and of clusters scaling 
relations that involve AS V , can be used to determine cos- 
mological parameters. The constraints on cosmological pa- 
rameters so obtained are complementary to those obtained 
e.g., from CMB primary anisotropics, cluster abundances, 
or clusters central SZ decrements. 

Our results show that the choice of statistic can have a 
significant impact on the derived constraints, at the level 
of a factor of several on the constrained parameters. 
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